Cluster-wise assessment of cluster stability

نویسنده

  • Christian Hennig
چکیده

Stability in cluster analysis is strongly dependent on the data set, especially on how well separated and how homogeneous the clusters are. In the same clustering, some clusters may be very stable and others may be extremely unstable. The Jaccard coefficient, a similarity measure between sets, is used as a clusterwise measure of cluster stability, which is assessed by the bootstrap distribution of the Jaccard coefficient for every single cluster of a clustering compared to the most similar cluster in the bootstrapped data sets. This can be applied to very general cluster analysis methods. Some alternative resampling methods are investigated as well, namely subsetting, jittering the data points and replacing some data points by artificial noise points. The different methods are compared by means of a simulation study. A data example illustrates the use of the cluster-wise stability assessment to distinguish between meaningful stable and spurious clusters, but it is also shown that clusters are sometimes only stable because of the inflexibility of certain clustering methods.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Genetic diversity assessment in physic nut (Jatropha curcas L.)

Mahalanobis’ D-square (D2) statistics was applied to assess diversity in the 9 genotypes collectedof semi-arid region of India (7 genotypes from Gujarat and Rajasthan for normal toxic and two fromOrissa csmcri’s plantation of non toxic nature. These genotypes were grouped into five. Cluster I andIII had two genotypes, cluster II had three genotypes and cluster VI and V contributed as solitaryge...

متن کامل

Assessment of Industrial Cluster with Value-Chain DEA model

  Every country actively and initiatively takes all kinds of policy steps to improve the international competitiveness of its industries and therefore improve national integrated competitiveness for the purpose of gaining economic benefits under the context of economic globalization. Therefore, studies of moreover, projects launched by the cluster had to be endorsed by the growers and one crit...

متن کامل

Microsatellite Markers to Complement Distinctness, Uniformity, Stability Testing of Brassica chinensis (Xiao Baicai) Varieties

Brassica chinensis varieties are mostly consumed as vegetables. In order to assess utility of microsatellite markers for testing distinctness, uniformity and stability of B. chinensis varieties, we used nine polymorphic microsatellite marker loci (with a total of 131 alleles) to evaluate four open pollinated and four hybrid varieties of B. chinensis. Each variety was represented by 48 randomly ...

متن کامل

Evaluation of Chickpea (Cicer arietinum L.) Genotypes for Cold Resistance in Autumn Cultivation in Rafsanjan Region of Iran

Pea cultivation in autumn has a higher yield compared to its spring cultivation. In order to investigate the possibility of changing the planting date of chickpea from spring to autumn in (Kerman province, Iran) and obtaining cultivars that could withstand cold winter and produce more yield, 63 chickpea genotypes were planted in pots at Dec/2016 and evaluated in field .The  agronomic traits inc...

متن کامل

Test-retest assessment of independent component analysis-derived resting-state functional connectivity based on functional near-infrared spectroscopy

Recent studies of resting-state functional near-infrared spectroscopy (fNIRS) have emerged as a hot topic and revealed that resting-state functional connectivity (RSFC) is an inherent characteristic of the resting brain. However, it is currently unclear if fNIRS-based RSFC is test-retest reliable. In this study, we utilized independent component analysis (ICA) as an effective RSFC detection too...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computational Statistics & Data Analysis

دوره 52  شماره 

صفحات  -

تاریخ انتشار 2007